Oracle Evaluation of Flexible Adaptive Transforms for Underdetermined Audio Source Separation
نویسندگان
چکیده
We describe and apply a flexible, adaptive cosine packet transform to separate audio sources from instantaneous, underdetermined audio mixtures by time-frequency masking. Previously studied adaptive transform schemes have two main drawbacks: the signal can only be partitioned into dyadic intervals, and the profiles of the overlapping windows are often very short, thus tapering off very quickly. The novel aspects of our new approach are that it admits a much larger library of admissible orthogonal bases, and thus does not require dyadic segmentation and alleviates border artifacts at window boundaries. Oracle estimation, which determines experimental upper performance bounds of our techniques, demonstrates potential performance improvements of up to 3.0 dB SDR, when compared with fixed-basis transforms such as the short-time Fourier transform and modified discrete cosine transform, and the previously studied adaptive cosine packet decomposition scheme.
منابع مشابه
Extension of Sparse, Adaptive Signal Decompositions to Semi-blind Audio Source Separation
We apply sparse, fast and flexible adaptive lapped orthogonal transforms to underdetermined audio source separation using the time-frequency masking framework. This normally requires the sources to overlap as little as possible in the time-frequency plane. In this work, we apply our adaptive transform schemes to the semiblind case, in which the mixing system is already known, but the sources ar...
متن کاملA Study of the Effect of Source Sparsity for Various Transforms on Blind Audio Source Separation Performance
In this paper, the problem of blind separation of underdetermined noisy mixtures of audio sources is considered. The sources are assumed to be sparsely represented in a transform domain. The sparsity of their analysis coefficients is modelled by the Student t distribution. This prior allows for robust Bayesian estimation of the sources, the mixing matrix, the additive noise variance as well as ...
متن کاملFirst Stereo Audio Source Separation Evaluation Campaign: Data, Algorithms and Results
This article provides an overview of the first stereo audio source separation evaluation campaign, organized by the authors. Fifteen underdetermined stereo source separation algorithms have been applied to various audio data, including instantaneous, convolutive and real mixtures of speech or music sources. The data and the algorithms are presented and the estimated source signals are compared ...
متن کاملOn the Use of Latent Mixing Filters in Audio Source Separation
In this paper, we consider the underdetermined convolutive audio source separation (UCASS) problem. In the STFT domain, we consider both source signals and mixing filters as latent random variables, and we propose to estimate each source image, i.e. each individual sourcefilter product, by its posterior mean. Although, this is a quite straightforward application of the Bayesian estimation theor...
متن کاملA Signal-adaptive Local Cosine Transform for Source Separation by Time-frequency Masking
Time-frequency masking is often used for source separation of underdetermined audio mixtures. It depends on the fact that the sources can be represented disjointly in some transform domain. The focus of this paper is on demixing sources from instantaneous, two-channel mixtures by binary masking. We investigate trees of local cosine bases from which a suitable transform may be generated—the best...
متن کامل